Supervised feature selection in mass spectrometry-based proteomic profiling by blockwise boosting

نویسندگان

  • Jan Gertheiss
  • Gerhard Tutz
چکیده

When feature selection in mass spectrometry is based on single m/z values, problems arise from the fact that variability is not only in vertical but also in horizontal direction, i.e. also slightly differing m/z values may correspond to the same feature. Hence, we propose to use the full spectra as input to a classifier, but to select small groups -- or blocks -- of adjacent m/z values, instead of single m/z values only. For that purpose we modify the LogitBoost to obtain a version of the so-called blockwise boosting procedure for classification. It is shown that blockwise boosting has high potential in predictive proteomics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum

Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...

متن کامل

Comparison of Supervised Classification Methods for Protein Profiling in Cancer Diagnosis

A key challenge in clinical proteomics of cancer is the identification of biomarkers that could allow detection, diagnosis and prognosis of the diseases. Recent advances in mass spectrometry and proteomic instrumentations offer unique chance to rapidly identify these markers. These advances pose considerable challenges, similar to those created by microarray-based investigation, for the discove...

متن کامل

A Multi-objective Genetic Programming Biomarker Detection Approach in Mass Spectrometry Data

Mass spectrometry is currently the most commonly used technology in biochemical research for proteomic analysis. The main goal of proteomic profiling using mass spectrometry is the classification of samples from different clinical states. This requires the identification of proteins or peptides (biomarkers) that are expressed differentially between different clinical states. However, due to the...

متن کامل

Machine learning methods for predictive proteomics

The search for predictive biomarkers of disease from high-throughput mass spectrometry (MS) data requires a complex analysis path. Preprocessing and machine-learning modules are pipelined, starting from raw spectra, to set up a predictive classifier based on a shortlist of candidate features. As a machine-learning problem, proteomic profiling on MS data needs caution like the microarray case. T...

متن کامل

Mathematical Framework and Wavelets Applications in Proteomics for Cancer Study

Cancer is a proteomic disease. Though MALDI-TOF mass spectrometry allows direct measurement of the protein signature of tissue, blood, or their biological samples, and holds tremendous potential for disease diagnosis and treatment, key challenges remain in the processing of proteomic data. In this chapter, we will introduce a wavelet based mathematical framework and computational tools for prot...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 25 8  شماره 

صفحات  -

تاریخ انتشار 2009